Optimal Algorithms for Testing Closeness of Discrete Distributions | Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics
نویسندگان
چکیده
We study the question of closeness testing for two discrete distributions. More precisely, given samples from two distributions p and q over an n-element set, we wish to distinguish whether p = q versus p is at least ε-far from q, in either `1 or `2 distance. Batu et al [BFR00, BFR13] gave the first sub-linear time algorithms for these problems, which matched the lower bounds of [Val11] up to a logarithmic factor in n, and a polynomial factor of ε. In this work, we present simple testers for both the `1 and `2 settings, with sample complexity that is information-theoretically optimal, to constant factors, both in the dependence on n, and the dependence on ε; for the `1 testing problem we establish that the sample complexity is Θ(max{n/ε, n/ε}).
منابع مشابه
Fair Scheduling via Iterative Quasi-Uniform Sampling | Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics
In the paper we consider minimizing the `k-norms of flow time on a single machine offline using a preemptive scheduler for k ≥ 1. We show the first O(1)approximation for the problem, improving upon the previous best O(log logP )-approximation by Bansal and Pruhs (FOCS 09 and SICOMP 14) where P is the ratio of the maximum job size to the minimum. Our main technical ingredient is a novel combinat...
متن کاملAn Efficient Algorithm for Computing High-Quality Paths amid Polygonal Obstacles | Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics
We study a path-planning problem amid a set O of obstacles in R, in which we wish to compute a short path between two points while also maintaining a high clearance from O; the clearance of a point is its distance from a nearest obstacle in O. Specifically, the problem asks for a path minimizing the reciprocal of the clearance integrated over the length of the path. We present the first polynom...
متن کاملCombinatorial Auctions via Posted Prices | Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics
We study anonymous posted price mechanisms for combinatorial auctions in a Bayesian framework. In a posted price mechanism, item prices are posted, then the consumers approach the seller sequentially in an arbitrary order, each purchasing her favorite bundle from among the unsold items at the posted prices. These mechanisms are simple, transparent and trivially dominant strategy incentive compa...
متن کاملOn Rationality of Nonnegative Matrix Factorization | Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics
Nonnegative matrix factorization (NMF) is the problem of decomposing a given nonnegative n × m matrix M into a product of a nonnegative n×d matrix W and a nonnegative d×m matrix H. NMF has a wide variety of applications, including bioinformatics, chemometrics, communication complexity, machine learning, polyhedral combinatorics, among many others. A longstanding open question, posed by Cohen an...
متن کاملTesting Soccer League Competition Algorithm in Comparison with Ten Popular Meta-heuristic Algorithms for Sizing Optimization of Truss Structures
Recently, many meta-heuristic algorithms are proposed for optimization of various problems. Some of them originally are presented for continuous optimization problems and some others are just applicable for discrete ones. In the literature, sizing optimization of truss structures is one of the discrete optimization problems which is solved by many meta-heuristic algorithms. In this paper, in or...
متن کامل